Estimation of Speaking Style in Speech Corpora Focusing on speech transcriptions
نویسندگان
چکیده
Recent developments in computer technology have allowed the construction and widespread application of large-scale speech corpora. To foster ease of data retrieval for people interested in utilising these speech corpora, we attempt to characterise speaking style across some of them. In this paper, we first introduce the 3 scales of speaking style proposed by Eskenazi in 1993. We then use morphological features extracted from speech transcriptions that have proven effective in style discrimination and author identification in the field of natural language processing to construct an estimation model of speaking style. More specifically, we randomly choose transcriptions from various speech corpora as text stimuli with which to conduct a rating experiment on speaking style perception; then, using the features extracted from those stimuli and the rating results, we construct an estimation model of speaking style by a multi-regression analysis. After the cross validation (leave-1-out), the results show that among the 3 scales of speaking style, the ratings of 2 scales can be estimated with high accuracies, which prove the effectiveness of our method in the estimation of speaking style.
منابع مشابه
Speaking-style dependent lexicalized filler model for key-phrase detection and verification
A task-independent ller modeling for robust keyphrase detection and veri cation is proposed. Instead of assuming task-speci c lexical knowledge, our model is designed to characterize phrases depending on the speaking-style, thus can be trained with large corpora of di erent but similar tasks. We present two implementations of the portable and general model. The dialogue-style dependent model tr...
متن کاملPronunciation variant analysis using speaking style parallel corpus
To improve the recognition accuracy for spontaneous conversational speech, we collected a corpus to study how spontaneous conversational speech differs from read style speech. The corpus consists of two parts: 1) spontaneous conversational speech and 2) read speech with the same word transcriptions as the conversational speech. In word and phone recognition experiments, it was confirmed that, f...
متن کاملThe SpeakingInfluence of Style on Lexical f Profiles in French
This study presents a comparison of French lexical fundamental frequency (f0) profiles for different speaking styles using phonemic, syllabic and lexical transcriptions as well as partof-speech annotations. Three speaking styles (broadcast news, broadcast conferences and conversations) with over 20 hours of speech were used. Syllabic word length and POS were considered as influential factors. R...
متن کاملPronunciation Variants Across Systems, Languages and Speaking Style
This contribution aims at evaluating the use of pronunciation variants across different system configurations, languages and speaking styles. This study is limited to the use of variants during speech alignment, given an orthographic transcription and a phonemically represented lexicon, thus focusing on the modeling abilities of the acoustic word models. Parallel and sequential variants are tes...
متن کاملA Lightweight on-the-fly Capitalization System for Automatic Speech Recognition
This paper describes a lightweight method for capitalizing speech transcriptions. Several resources were used, including a lexicon, newspaper written corpora and speech transcriptions. Different approaches were tested both generative and discriminative: finite state transducers, automatically built from Language Models; and maximum entropy models. Evaluation results are presented both for writt...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014